Phonetic Ambiguity : Approaches, Touchstones, Pitfalls and New Approaches
نویسنده
چکیده
Phonetic ambiguity and confusibility are bugbears for any form of bottom-up or data-driven approach to language processing. The question of when an input is “close enough” to a target word pervades the entire problem spaces of speech recognition, synthesis, language acquisition, speech compression, and language representation, but the variety of representations that have been applied are demonstrably inadequate to at least some aspects of the problem. This paper reviews this inadequacy by examining several touchstone models in phonetic ambiguity and relating them to the problems they were designed to solve. An good solution would be, among other things, efficient, accurate, precise, and universally applicable to representation of words, ideally usable as a “phonetic distance” metric for direct measurement of the “distance” between word or utterance pairs. None of the proposed models can provide a complete solution to the problem; in general, there is no algorithmic theory of phonetic distance. It is unclear whether this is a weakness of our representational technology or a more fundamental difficulty with the problem statement. In any case, these results show that the representations can be as crucial as the system architecture, and that as much or more creativity is required to properly represent language as to process it.
منابع مشابه
Touchstones, Pitfalls, and New Directions
Phonetic ambiguity and confusibility are bugbears for any form of bottom-up or data-driven approach to language processing. The question of when an input is “close enough” to a target word pervades the entire problem spaces of speech recognition, synthesis, language acquisition, speech compression, and language representation, but the variety of representations that have been applied are demons...
متن کاملAnalysis of Phonetic Matching Approaches for Indic Languages
Phonetic matching plays an important role in multilingual information retrieval, where data is manipulated in multiple languages. User needs information in their local language which may be different from the language where data has been maintained. In such an environment, we need a system which matches the strings phonetically irrespective of errors either exactly or approximately. There are m...
متن کاملبررسی مقایسهای نظام قدیم و جدید آموزش مدیران و کارکنان دولت با نگاه استراتژیک
While reviewing the history and importance of traning state executives and employees, this article surveys the old and new on- the- job training for the said groups. The study primarily reviews the objectives and plans of each approach. Then the pitfalls of the first approach are enumerated, as studied by the State Management and Planning Organization's research team. The two approaches are com...
متن کاملGrey theory, VIKOR and TOPSIS Approaches
Abstract This author introduces the concept of Stepwise Strategy Approach (SSA) for dealing with a number of problems arises in the current age of technology. This new idea is combined with the knowledge of Grey Theory for adding flexibility to decision making process. Grey theory is useful for grasping the ambiguity exists in the utilized information and the fuzziness appears in the human judg...
متن کاملمقایسه روش های طیفی برای شناسایی زبان گفتاری
Identifying spoken language automatically is to identify a language from the speech signal. Language identification systems can be divided into two categories, spectral-based methods and phonetic-based methods. In the former, short-time characteristics of speech spectrum are extracted as a multi-dimensional vector. The statistical model of these features is then obtained for each language. The ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره cmp-lg/9608020 شماره
صفحات -
تاریخ انتشار 1996